Towards Balanced Defect Prediction with Better Information Propagation
نویسندگان
چکیده
Defect prediction, the task of predicting presence defects in source code artifacts, has broad application software development. prediction faces two major challenges, label scarcity, where only a small percentage artifacts are labeled, and data imbalance, majority labeled non-defective. Moreover, current defect methods ignore impact information propagation among this negligence leads to performance degradation. In paper, we propose DPCAG, novel model address above three issues. We treat as nodes graph, learn propagate influence neighboring iteratively an EM framework. DPCAG dynamically adjusts contributions each node selects high-confidence for augmentation. Experimental results on real-world benchmark datasets show that improves compare state-of-the-art models. particular, achieves substantial superiority when measured by Matthews Correlation Coefficient (MCC), metric is widely acknowledged be most suitable imbalanced data.
منابع مشابه
Towards Cross-Project Defect Prediction with Imbalanced Feature Sets
Cross-project defect prediction (CPDP) has been deemed as an emerging technology of software quality assurance, especially in new or inactive projects, and a few improved methods have been proposed to support better defect prediction. However, the regular CPDP always assumes that the features of training and test data are all identical. Hence, very little is known about whether the method for C...
متن کاملBalanced mitochondria behave better
Chen et al. fi nd that evening out mitochondrial fusion and fi ssion allows the organelles to regain their normal function. Mutations in genes that control mitochondrial fusion or fission are responsible for various diseases, including dominant optic atrophy. In all of these diseases, either fusion or fi ssion is defective, and one poten...
متن کاملEffects of Defect Propagation/Growth on In-Line Defect-Based Yield Prediction
This paper presents the importance of understanding defect propagation/growth and its impact on in-line yield prediction. In order to improve the prediction accuracy, impact of defect propagation and growth phenomena needs to be modeled and incorporated into yield prediction system. We developed a new yield prediction model by taking into account defect carryover. The empirical results of inter...
متن کاملIs"Better Data"Better than"Better Data Miners"? (On the Benefits of Tuning SMOTE for Defect Prediction)
We report and fix an important systematic error in prior studies that ranked classifiers for software analytics. Those studies did not (a) assess classifiers on multiple criteria and they did not (b) study how variations in the data affect the results. Hence, this paper applies (a) multi-criteria tests while (b) fixing the weaker regions of the training data (using SMOTUNED, which is a self-tun...
متن کاملTowards Better Understanding of Protein Secondary Structure: Extracting Prediction Rules
Although numerous computational techniques have been applied to predict protein secondary structure (PSS), only limited studies have dealt with discovery of logic rules underlying the prediction itself. Such rules offer interesting links between the prediction model and the underlying biology. In addition, they enhance interpretability of PSS prediction by providing a degree of transparency to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i1.16157